High performance digit recognition in real car environments

نویسندگان

  • Umit H. Yapanel
  • Xianxian Zhang
  • John H. L. Hansen
چکیده

In this paper, we consider the problem of robust digit recognition in real car environments. We choose to utilize newlycollected CU-Move database [2]. We address the problem using two integrated approaches . First, we consider array processing, enhancement and noise adaptation techniques as an integrated solution. This approach reduced the word error rate (WER) 38.6% and increased word accuracy (WAC) 47.1%, relative to baseline results. Secondly, we use array processing, enhancement, cepstral mean normalization, vocal tract length normalization and MLLR adaptation as an alternative solution. The net gain obtained with this solution is 55.4% reduction in WER and 64.3% increase in WAC, relative to baseline results. The first approach has the advantage of speed since all operations can be performed in real-time, while the second approach maintains high accuracy at the cost of increased computational requirements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition

This paper introduces a common database and an evaluation framework for connected digit speech recognition in real driving car environments, CENSREC-2, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. Speech data of CENSREC-2 was collected using two microphones, a close-talking microphone and a hands-free microphone, under three car speeds and four car conditions...

متن کامل

Real Time Implementation of a License Plate Location Recognition System Based on Adaptive Morphology

License plate recognition (LPR) by using morphology has the advantage of resistance to brightness changes; high speed processing, and low complexity. However these approaches are sensitive to the distance of the plate from the camera and imaging angle. Various assumptions reported in other works might be unrealistic and cause major problems in practical experiences. In this paper we considered ...

متن کامل

Evaluation of a Noise Adaptive Speech Aurora 3 Datab

In this paper, we present evaluation results of a noise adaptive speech recognition system with combination of several techniques for robust speech recognition. The evaluation was on AURORA 3 database which contains noisy digit utterances collected in real car environments through close-talking and hands-free microphones. The techniques in the system include segmentation, maximum likelihood lin...

متن کامل

Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments

In real-world adverse environments, speech signal corruption by background noise, microphone channel variations, and speech production adjustments introduced by speakers in an effort to communicate efficiently over noise (Lombard effect) severely impact automatic speech recognition (ASR) performance. Recently, a set of unsupervised techniques reducing ASR sensitivity to these sources of distort...

متن کامل

Evaluation of real-time audio-visual speech recognition

In this paper, we propose and develop a real-time audio-visual automatic continuous speech recognition system. The system utilizes live speech signals and facial images that collected from a microphone and a camera. Optical-flow-based features are used as visual feature. VAD technology and lip tracking are utilized to improve recognition accuracy. In this paper, several experiments are conducte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002